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Real Par*v Tn Tntcrcst 

The real party in interest in this appeal is INTERN ATIONAL BUSINESS 
MACHINES CORPORATION, a corporation of New York having a place of business at New 
Orchard Road, Armonk, New York 10504. 



PACE 3/22 * RCVD AT 1/14/2008 9:08:36 PM [Eastern Standard Time] • SVR:USPTO-EFXRF-SB " DNIS:2738300 * CSID:15137720052 * DURATION (mm-ss):06-48 



Jan 14 08 09:09p 



Humphrey 



15137720052 



p. 4 



Related Appeals and Interferences 

There are no such appeals or interferences. 
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JAN 142008 



Status of Claims 

Total Number of Claim s in the Application 

Claims in the application are: 23 

Status of all the Claims 

1. Claims canceled: NONE 

2. Claims withdrawn from consideration but not canceled: NONE 

3. Claims objected to: NONE 

4. Claims allowed or confirmed: NONE 

5. Claims rejected: 1-23 



Claims on Appeal 

The claims are appeal are Claims 1-23. The claims in this application have never been 
amended. 



-4- 



PACE 5(22* RCVD AT 1/14/2008 9:08:36 PM [Eastern Standard Time] • SVR:USPTO-EFXRF-5/6 • DNIS:2738300* CSID:1 51 37720052 * DURATION (mm-ss):06-48 



Jan 14 08 09:10p Humphrey 15137720052 p,B 



Status of Amendments 

An Amendment of Appeal filed with this brief, correcting typographical errors and 

omissions in the specification, is pending. 
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Summary of Claimed S ubject Matter as to Independent Claim 1 

The subject matter of independent Claim 1 is described in the specification on page 9. 
line 19 to page 12, line 1 1 , and Figs. 4-5, reference numbers 200-350 of the drawings. 

These pages disclose examples of the invention recited in claim t, which describes a 
method that utilizes "a data storage structure storing character statistics on an attribute" (see 
Fig. 4 and the description thereof at page 9 lines 22-25 and page 10 lines 1-9), comprising "a 
first structure storing, for each of a plurality of character positions, frequently occurring 
characters in that character position" (see in Fig. 4, die storage of frequently occurring 
characters lor positions 1, 2, 3 and 4). This structure is used in conjunction with a "pattern 
matching predicate" (e.g., as noted at page 12. line 12 to page 14 line 20, which explains the 
process in conjunction with Fig. 5, and the example at page 14 line 21 - page 16 line 6 relating 
to the pattern match to the string "BIB"). As recited in the claim, pattern matching with the 
structure proceeds by the use of "character positions of characters in [the] pattern matching 
predicate" as can be seen in the process of Fig. 6 and the example noted above. 

Thus, the invention is directed to the storage of statistical information regarding 
characters in an attribute of a database relation, and the use of those statistics in evaluating a 
pattern matching predicate. 
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Summary of Claimed S ub ject Matter as to Independent C laim 11 

Independent Claim 1 1 is directed to a computer system, such as that described in the 
specification at page 5 line 4 through page 9 line 1 8 : which implements a process as described 
on page 9, line 19 to page 12, line 1 1, and Figs. 4-5, reference numbers 200-350 of the 
drawings. 

The latter pages disclose examples.of the invention recited in claim 1 1, which 
describes an apparatus that utilizes "a data storage structure storing character statistics on an 
attribute" (see Fig. 4 and the description thereof at page 9 lines 22-25 and page 10 lines 1-9), 
comprising "a first structure storing, for each of a plurality of character positions, frequently 
occurring characters in that character position" (see in Fig. 4, the storage of frequently 
occurring characters for positions 1, 2, 3 and 4). This structure is used in conjunction with a 
"pattern matching predicate" (e.g.. as noted at page 12. line 12 to page 14 line 20, which 
explains the process in conjunction with Fig. 5, and the example at page 14 line 21 - page 16 
line 6 relating to the pattern match to the string "BIB"). As recited in the claim, pattern 
matching with the structure proceeds by the use of "character positions of characters in [the] 
pattern matching predicate" as can be seen in the process of Fig. 6 and the example noted 
above. 

Thus, the invention of claim 1 1 is also directed to the storage of statistical information 
regarding characters in an attribute of a database relation, and the use of those statistics in 
evaluating a pattern matching predicate. 
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Summary of Claimed Subject Matter a s to In de pe ndent Claim 21 

Independent Claim 21 is direeted to a program product, implementing a relational 
database system such as described in the specification on page 9, line 19 to page 12, line 11, 
and Figs. 4-5 , reference numbers 200-350 of the drawings. 

These pages disclose examples of the invention recited in claim 21, which describes a 
program product that utilizes "a data storage structure storing character statistics on an 
attribute" (see Fig. 4 and the description thereof at page 9 lines 22-25 and page 10 lines 1-9), 
compri sing "a first structure storing, for each of a plurality of character positions, frequently 
occurring characters in that character position" (see in Fig. 4, the storage of frequently 
occurring characters for positions 1,2,3 and 4). This structure is used in conjunction with a 
"pattern matching predicate" (e.g., as noted at page 12, line 12 to page 14 line 20, which 
explains the process in conjunction with Fig. 5, and the example at page 14 line 21 - page 16 
line 6 relating to the pattern match to the suing "BIB"). As recited in the claim, pattern 
matchi ng with the structure proceeds by the use of "character positions of characters in [the] 
pattern matching predicate" as can be seen in the process of Fig. 6 and the example noted 
above. 

Thus, again, claim 21 is directed to the storage of statistical information regarding 
characters in an attribute of a database relation, and die use of those statistics in evaluating a 
pattern matching predicate. 



-8- 



PACE 9/22 * RCVD AT 1/14/2008 9:08:36 PM [Eastern Standard Time] ' SVR:USPTO-EFXRF-5/8 ' DNIS:2738300 * CSID:1 51 37720052 * DURATION (mm-ss):0e-48 



Jan 14 08 09: lip Humphrey 



15137720052 



p, 10 



Grounds of Rejection under 36 IJ.S.C* 103fa> 

Claims 1-23 are rejected under 35 U.S.C. 103(a) as being unpatentable over Fritchman 
(US Patent No. 6,785,677 Bl 7 filed: May 2 ? 2001) in view of Haas et al. (Haas hereinafter) 
(Non-Patent Literature: Sampling - Based Selectivity for Joins Using Augmented Frequent 
Value Statistics, IBM ALmaden Research Center, San Jose CA, 1995, IEEE). 
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Argument 

The Examiners rejection is premised upon the assertion that the Fritchman '677 
patent discloses storing character structures that meet the claim language. The claim language 
recites a structure in which, "for each of a plurality of character positions" the structure 
identi fies "frequently occurring characters in that character position, and statistics for each 
frequently occurring character". Fritchman does not disclose this. Fritchman, in the cited 
passages, only discloses how to describe a character string criterion for a query, indicating that 
strings should be described using a prefix segment, a suffix segment, and one or more interior 
segments. 

The Examiner has cited to passages in col. 8 of Fritchman regarding a "bit vector 
noting the position of any single-character wild cards" (that can be included in a LIKE 
predicate of a query). The Examiner seems to believe that the representation of a LIKE 
predicate using wildcard characters, is somehow the same as a statistical representation of 
frequently occurring characters in particular character locations in an attribute. They are 
clearly not the same. A wildcard is a way to identify a pattern of characters that match a 
criterion, not a way of identifying, for an attribute of a relation, frequently occurring 
characters in a character position. Indeed, Fritchman does not disclose the generation of any 
such statistical information, or the use of the same, but rather discloses a matching 
methodology in which a "pattern is applied to each database value in turn, where the value is 
the result of evaluating the first operand of the LIKE predicate for each candidate row 
retrieved from the database." Clearly, the application of a pattern to every database value to 
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determine exactly how many matches exist, is entirely different from using characterizing data 
to estimate a number of matches. 

The Examiner asserts the Haas paper discloses storing character statistics on an 
attribute, and statistics for each frequently occurring character. However, the Examiner is 
incorrect here as well. The Examiner asserts that page 528 of Haas discloses "storing 
character statistics on an attribute 7 ' and "statistics for each frequently occurring character". 
Applicant is unable to identify any such disclosure. Specifically, the text quoted in Haas is 
directed to sampling-based selectivity estimation, using what are called 'augmented frequent 
value' (AFV) statistics. While selectivity estimation and the use of frequent value statistics is 
somewhat relevant to the present invention, Applicant has been unable to identify any 
disclosure in the cited paragraphs of the creation of an index or statistic for text matching, that 
stores 'Tor each of a plurality of character positions, frequently occurring characters in that 
character position", as claimed. The quoted paragraphs on page 528 only generally discuss 
the creation of AFV statistics and do not detail a particular form of such statistics for text 
matching, much less the particular form that is recited in the present claims, namely, "for each 
of a plurality of character positions, frequently occurring characters in that character position." 

As the Examiner has not identified how either cited reference relied upon in the 
rejection discloses a index or statistic meeting the claim recitations, namely, 'Tor each of a 
plurality of character positions, frequently occurring characters in that character position," 
Applicant submits that all claims presented herein are allowable and requests the issuance of a 
Notice of Allowability. 
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The Examiner attention is directed to an Amendment for Appeal attached hereto which 
corrects typographical errors noted by Applicant during the preparation of this appeal brief. 
Applicant submits that these amendments do not introduce new matter or new issues and 
respectfully requests their entry. 

In accordance with the foregoing, Applicant submits that the Examiner's rejection is in 
error and a reversal of the rejection and allowance of the claims is therefore requested. 



Respectfully submitted. 
Wood, Herron & Evans, L.L.P. 




Thomas W. Humphrey 
Reg. No. 34353 

2700 Carew Tower 
441 Vine Street 
Cincinnati, OH 45202-29 1 7 

Voice: (513)241-2324 
Facsimile: (513)241-6234 
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RECEIVED 
CENTRAL FAX CENTER 

JAN 1 if 2008 

Claim Appendix 

1 . (original) A method for generating a statistic for a pattern matching 
predicate on an attribute of a relation, to be used in optimizing execution of a 
query directed to one or more attributes of said relation, comprising 

providing a data storage structure storing character statistics on said 
attribute, comprising a first structure storing, for each of a plurality of character 
positions., frequently occurring characters in that character position, and statistics 
for each frequently occurring character, 

retrieving said statistics in response to said pattern matching predicate 
based upon the character positions of characters i n said pattern matching 
predicate, and 

generating said statistic based upon said retrieved character statistics. 

2. (original) The method of claim 1 wherein said data storage structure further 
stores a count of a number of occurrences of a frequently occurring character. 

3. (original) The method of claim 1 wherein said data storage structure further 
comprises a second structure storing frequently occurring characters that are 
subsequent to the frequently occurring characters stored in said first structure. 

4. (original) The method of claim 3 wherein said second data storage structure 
further stores a probability of occurrence of a frequently occurring subsequent 
character. 

-13- 
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5. (original) The method of claim 1 wherein generating said statistic based 
upon said retrieved character statistics comprises estimating a statistic for a 
desired character and desired character position, in the event said statistic is not 
stored in said first structure. 



6, (original) The method of claim 5 wherein said estimating comprises 
accumulating statistics for said desired character position, for characters other 
than said desired character, and calculating average frequency of occurrence of 
characters that do not have statistics stored in said first structure. 



7. (original) The method of claim 1 wherein said first structure is a table. 

8. (original) The method of claim 1 wherein said first structure comprises a 
linked list. 



9. (original) The method of claim 3 wherein said second structure comprises a 
table. 

1 0. (original) The method of claim 3 wherein said second structure comprises a 
linked list. 

1 1 . (original) A computer system implementing a relational database system 
and generating a statistic for a pattern matching predicate on an attribute of a 
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relation of said relational database, to be used in optimizing execution of a query 
directed to one or more attributes of said relation, comprising 

storage for said relational database, including a relation having a plurality 
of tuples including values for a plurality of attributes, and a data storage 

structure storing character statistics on an attribute, comprising a first structure 

storing, for each of a plurality of character positions, frequently occurring 

characters in that character position, and statistics for each frequently occurring 

character, and 

computing circuitry performing query optimization and query execution 
upon said relational database, said query optimization including generating said 
statistic for an attribute of said relation by retrieving said statistics in response to 
said pattern matching predicate based upon the character positions of characters 
in said pattern matching predicate, and generating said statistic based upon said 
retrieved character statistics. 

12. (original) The computer system of claim 1 1 wherein said data storage 
structure further stores a count of a number of occurrences of a frequently 
occurring character, 

13. (original) The computer system of claim 1 1 wherein said data storage 
structure further comprises a second structure storing frequently occurring 
characters that are subsequent to the frequently occurring characters stored in 
said first structure. 
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14. (original) The computer system of claim 1 3 wherein said second data 
storage structure further stores a probability of occurrence of a frequently 
occurring subsequent character. 

15. (original) The computer system of claim 1 1 wherein said computing 

circuitry generates said statistic based upon said retrieved character statistics by 

estimating a statistic for a desired character and desired character position, in the 
event said statistic is not stored in said first structure. 

16. (original) The computer system of claim 15 wherein said computer circuitry 
performs said estimating by accumulating statistics for said desired character 
position, for characters other than said desired character, and calculating average 
frequency of occurrence of characters that do not have statistics stored in said 
first structure. 

17. (original) The computer system of claim 1 1 wherein said first structure is a 
table. 

18. (original) The computer system of claim 1 1 wherein said first structure 
comprises a linked list. 

19. (original) The computer system of claim 1 3 wherein said second structure 
comprises a table. 
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20. (original) The computer system of claim 13 wherein said second structure 
comprises a linked list. 

21 . (original) A program product for implementing a relationa] database system 
and generating a statistic for a pattern matching predicate on an attribute of a 
relation of said relational database, to be used in optimizing execution of a query 
directed to one or more attributes of said relation, comprising 

a relational database, including a relation having a plurality of tuples 
including values for a plurality of attributes, 

a data storage structure storing character statistics on an attribute, 
comprising a first structure storing, for each of a plurality of character positions, 
frequently occurring characters in that character position, and statistics for each 
frequently occurring character, and 

relational database software performing query optimization and query 
execution upon said relational database, said query optimization including 
generating a statistic for an attribute of said relation by retrieving said statistics 
in response to said pattern matching predicate based upon the character positions 
of characters in said pattern matching predicate, and generating said statistic 
based upon said retrieved character statistics, and 

a signal bearing media holding said relational database and relational 
database software. 
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22. (original) The program product of claim 21 wherein the signal bearing 
media comprises transmission media. 

23. (original) The program product of claim 21 wherein the signal bearing 
media comprises recordable media. 
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Evidence Appendix 

None. 
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Related Proceedings Appendix 

None. 
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